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We present a new measurement of the volumetric rate of Type la supernova 
up to a redshift of 1.7, using the Hubble Space Telescope (HST) GOODS data 
combined with an additional HST dataset covering the North GOODS field col- 
lected in 2004. We employ a novel technique that does not require spectroscopic 
data for identifying Type la supernovae (although spectroscopic measurements 
of redshifts are used for over half the sample); instead we employ a Bayesian 
approach using only photometric data to calculate the probability that an ob- 
ject is a Type la supernova. This Bayesian technique can easily be modified 
to incorporate improved priors on supernova properties, and it is well-suited for 
future high-statistics supernovae searches in which spectroscopic follow up of all 
candidates will be impractical. Here, the method is validated on both ground- 
and space-based supernova data having some spectroscopic follow up. We com- 
bine our volumetric rate measurements with low redshift supernova data, and 
fit to a number of possible models for the evolution of the Type la supernova 
rate as a function of redshift. The data do not distinguish between a flat rate at 
redshift > 0.5 and a previously proposed model, in which the Type la rate peaks 
at redshift ~ 1 due to a significant delay from star-formation to the supernova 
explosion. Except for the highest redshifts, where the signal to noise ratio is gen- 
erally too low to apply this technique, this approach yields smaller or comparable 
uncertainties than previous work. 



1. Introduction 



The empi rical evidence for the existence of dark energy came from observations of Type 
la sup ernovae ( Riess et al.lll998 : Perlmutter et al. 1999 : for review, see Perlmutter and Schmidt 
20031 ). which are believed to arise from the thermonuclea r explosion of a proge nitor white 
dwarf after it approaches the Chandrasekhar mass limit (jChandrasekharl Il93ll ). However, 
the physics of Type la supernova production is not well understood. The two most plausible 
scenarios for the white dwarf to accrete the necessary mass are the single degenerate case, 
where the white dwarf is located in a binary system; and the double degenerate case, where 
two white dwarfs merge. The Type la supernova rate is correlated with the star formation 
history (SFH), and thus a measurement of the rate as a function of redshift helps constrain 
the possible type la progenitor models. 

In addition to its importance for understanding Type la supernovae as astronomical 
objects, a good grasp of the Type la supernova rate to high redshifts is important for 
the next generation of proposed space-based supernova cosmology experiments, such as 
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SNAP ( lAldering et al.ll2004l ). It is therefore of great practical interest to determine the 
rate of Type la supernovae at redshifts > 1. 

The subject of Type la supernova rates has been addressed by many authors in the past. 
E xisting rate measuremen ts have been rnostly h mited to redshift range s < 1: the results 



of ICappellaro et all fll999[ ). iHardin et all (120001) . iMadgwick et all (120031) and iBlanc et al. 



Tonrv et al.l(l2003h. and 



(2004) measure the rates at redshifts < ^0.1: lNeill et al.l (l200fil ). 
(2002), at intermediate redshifts of 0.47, 0.50, and 0.55, respectively; and iBarris and Tonry 
( 120061 ) ■ up to a redshift o f 0.75. The only published measurement of the rates at redshifts > 



Pain et al. 



1 is that of iDahlen et al.l (120041 ). who analyzed the GOODS dataset. 



There are several important differences that distinguish our work from that of lDahlen et al. 



( 120041 ). First, we augment the GOODS sample with the HST data collected during the 
Spring-Summer 2004 high redshift supernova searches. Second, our methods of calculating 
the control time (the time during which a supernova search is potentially capable of finding 
supernova candidates) and the efficiency to identify a supernova are based on a detailed 
Monte Carlo simulation technique using a library of supernova templates. Third, we adopt 
a novel approach to t yping supernovae, using p h otorn etric data and a Bayesian probability 
method described in iKuznetsova and ConnoUyl (120071 ). The Bayesian technique is able to 
perform classification using only photometric data, and therefore does not require spectro- 
scopic follow up. Optionally, photometric or spectroscopic redshifts can be used to improve 
the classification accuracy. Our initial requirements on potential supernova candidates are 
more stringent in terms of th e number of points on the light curve and the signal to noise 
of those points than those of iDahlen et al.l (120041 ): thus some of the candidates they iden- 
tified will fail our cuts. However, we are able to reliably separate Type la supernova from 
other supernovae types based on their Bayesian probability, with an efficiency that is readily 
quantifiable, thus allowing us to use larger data samples. Our approach therefore avoids the 
problems that arise in estimating the efficiency for the decision to schedule spectroscopic 
follow up based on a potentially low signal-to-noise initial detection. 

The Bayesian classification technique uses photometric data, and does not require any 
spectroscopic foUowup. This is an advantage for future large-area surveys (such as the 
Dark Energy Survey, Pan-STARRS, and LSST) that will discover thousands of supernova 
candidates, but are unlikely to be able to obtain spectroscopic data for all of them, to 
distinguish Type la supernovae from core collapse supernovae and other variable objects. 
The technique described here can be considered a prototype of the kind of analysis that could 
be performed on these future large data sets to identify Type la supernovae for cosmological 
studies. There is a clear trade-off involved in using photometric measurements alone: if 
the quality of the photometric data is poor, then the efficiency of this technique to identify 
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Type la supernovae is reduced; on the other hand, this technique enables larger samples of 
Type la from imaging surveys to be identified for cosmological studies, without the need for 
time-consuming spectroscopic follow up. 

Note that although the method is able to perform the supernova typing with photo- 
metric data alone (i.e., it does not require spectroscopic data, either redshifts or types), 
it is certainly able to use the extra information that is available, and in fact 70% of the 
supernova candidates discussed in the present work have redshifts which were obtained spec- 
troscopically. It is also worth noting that while in this paper we only analyze the Type la 
supernova rates, the Bayesian classification technique can be used to classify other types as 
well, making it possible to measure the rates of non-Type la supernovae in a similar fashion. 
These analyses will be presented in future publications. 

The paper is organized as follows. In section [2] we describe the data samples used in the 
analysis. In section [3] we describe the supernova candidate selection and typing process. In 
section 4 we calculate the control time, survey area, and search efficiency, and determine the 
volumetric Type la supernova rate from our data sample. A comparison of the rates with 
those reported in the literature is given in section O and fits of the rates to different models 
relating the Type la supernova rates to the SFH are given in section [61 A summary is given 
in section [3 



2. Data Sample 



For this analysis, we use the Hubble Space Telescope (HST) GOODS dat aset collected 
in 2002-2003 jRenzini et al.lbooi [pickinson et aUboOsI : Iciavalisco et al]l2004[ l. In addition 
to the GOODS data, we use an HST sample collected in the Spring-Summer of 2004, which 
hereafter we will call the 2004 ACS sample. The GOODS dataset consists of five epochs 
(data taking periods), separated by approximately 45 observer-frame days. The GOODS 
data used for this analysis were taken in two HST ACS filter bands: F775W (centered at 
775 nm) and F850LP (centered at 850 nm)0. Each F850LP image consists of four exposures; 
and each F775W image, of two. The GOODS survey includes two fields, GOODS North and 
GOODS South, and covers approximately 320 square arcminutes. The fields are sub-divided 
into smaller "tiles" that correspond to single ACS pointings (typically 15 or 16), as shown 
in Fig. [TJ 



The 2004 ACS supernova dataset covers only the GOODS North field, with the same 



The ACS filter transmission curves are available at 



http://acs.pha.jhu.edu/ instrument /filters/ 



- 5 - 



GOO'DS-North ' GOODS-South 




Fig. 1. — The North (left) and South (right) GOODS fields. The fields are subdivided into 
tiles, which are shown (along with their ID numbers) on the figures. The size of a single tile 
is ~11.5 sq. arcmin. 



tiling as that of the GOODS North dataset. It consists of 4 epochs separated by approx- 
imately 45 observer-frame days. The data in this sample were taken in two HST ACS 
passbands: F775W and F850LP, with one exposure for every F775W image and four for 
every F850LP image. Two teams (P I Perlmutte r and PI Riess) shared this data search- 
ing for supernovae in alternate visits; iRiess et al.l (120071 ) have published the results for the 
supernovae that were discovered in their team's visits. 

For convenience, a summary of the dataset s used is given in Table [H 

It is worth emphasizing that we are using photometric information from only two filter 
bands, providing one color measurement. The GOODS data set has been analyzed before, and 



13 ou t of 42 supernovae found were spectroscopically typed (IRiess et al.ll2004al : IStrolger et al. 



20041 ). For the 2004 ACS sample, however, the spectroscopic information is available only 
for a small fraction of the candidates. We treat both GOODS and 2004 ACS datasets in a 
consistent fashion, using photometric information only for typing supernovae (note that we 
still use spectroscopically determined redshifts where available). This allows more data to 
be searched and more supernovae to be found, but at the expense of neglecting spectroscopic 
information for the candidates where it is available. In section 13.31 we discuss in detail the 
resulting supernova candidate count. 
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Epoch 


Fiher 


Exp. time (s) 


Filter 


Exp. time (s) 


# Tiles 


Taken On 


GOODS South 


1 


F775W 


1040 


F850LP 


2120 


15 


7/31 - 8/4 (2002) 


2 


F775W 


1040 


F850LP 


2120 


16 


9/19 - 9/22 (2002) 


3 


F775W 


1040 


F850LP 


2120 


15 


10/31 - 11/3 (2002) 


4 


F775W 


1040 


F850LP 


2120 


16 


12/19 - 22 (2002) 


5 


F775W 


1040 


F850LP 


2120 


15 


2/1 - 2/5 (2003) 


GOODS North 


1 


F775W 


1120 


F850LP 


2400 


14 


11/21 - 11/22 (2002) 


2 


F775W 


1000^ 


F850LP 


2120 


17 


1/2 - 1/4 (2003) 


3 


F775W 


960 


F850LP 


2060 


16 


2/20 - 2/23 (2003) 


4 


F775W 


960 


F850LP 


2000 


14 


4/3 - 4/6 (2003) 


5 


F775W 


960 


F850LP 


2080 


15 


5/21 - 5/25 (2003) 


2004 ACS Sample (GOODS North tiles) 


1 


F775W 


400 


F850LP 


1600 


15 


4/2 - 4/4 (2004) 


2 


F775W 


400 


F850LP 


1600 


15 


5/20 - 5/23 (2004) 


3 


F775W 


400 


F850LP 


1600 


15 


7/9 - 7/10 (2004) 


4 


F775W 


400 


F850LP 


1600 


15 


8/26 - 8/28 (2004) 



'^except for tile 30 (1060) 



Table 1: A summary of the datasets used in this analysis, listing the data taking epochs, the 
filters, the exposure times of the combined exposures (in seconds), the number of GOODS 
field tiles, and the dates when the data were taken. 
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We start with the data th at have been flat-f i elded and gain-corrected by the HST 



pipeline, and use MultiDrizzle ( iFruchter and Hookl 120021 ) to perform cosmic ray rejection 



and to combine dithered observations. The parameters of the drizzling process include a 
"square" kernel, with a pixel fraction of 0.66 and a pixel scale of 1.0. The drizzling com- 
bines the multiple individual pointings. Drizzling is ineffective for the cosmic ray rejection 
for the F775W data from the 2004 ACS sample since they contain only a single exposure 
for e ach GOODS North tile. We therefore use a morphological cosmic ray rejection pack- 



age (Ivan Dokkumll200ll ) to create images with identifiable objects, thus allowing us to gener- 
ate the geometrical transformations between images; however, the original images are used 
for extracting photometric information (after verifying that no cosmic rays landed directly 
at the location of the supernova candidates). 

Supernovae are identified by subtracting a reference image from each of the HST search 
epochs. We create four distinct samples summarized in Table [21 which we use for identifying 
and performing simple aperture photometry on the supernova candidates in each of the five 
epochs in the GOODS dataset and each of the four epochs in the 2004 ACS dataset. To 
obtain the multi-epoch photometry for the GOODS North data (sample #1), we combine 
all four epochs of the 2004 ACS sample and then subtract these data from each of the five 
North GOODS epochs in turn. Combining multiple epochs for the reference image allows us 
to create deeper resulting data, which is important for extracting supernovae with the best 
possible signal-to-noise ratio (SNR). For sample #2, we combine the entire North GOODS 
sample and subtract these data from each of the four 2004 ACS epochs in turn. Because 
the GOODS and 2004 ACS data were taken with a time separation of approximately a 
year, these samples should be sensitive to the supernovae that were both on the rise and 
on the decline during the GOODS and 2004 ACS data taking period for samples #1 and 
#2, respectively. For the GOODS South sample, however, we do not have any additional 
datasets, and are thus forced to separate the sample into two. This is the reason the three 
initial data samples (GOODS North and South and the 2004 ACS data set) result in four 
search samples. We combine South epochs 4 and 5 for sample #3, and epoch 1 and 2 for 
sample #4; we then subtract the two combined samples separately from each of the five 
South GOODS epochs. If a supernova candidate has been found in both samples #3 and 
#4, we consider it belonging to the sample in which it had an epoch with the largest SNR. 
This avoids any possible double- counting of the candidates for the GOODS South data. 
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Sample ^ 


Reference Dataset 


Supernova Search 


1 
2 
3 

4 


Combined 2004 ACS data (4 epochs) 
Combined North GOODS data (5 epochs) 
Epochs 4+5 of the South GOODS data 
Epochs 1+2 of the South GOODS data 


Individual North GOODS epochs 
Individual 2004 ACS dataset epochs 
Individual South GOODS epochs 
Individual South GOODS epochs 



Table 2: The samples used in our supernova search. To identify and extract photometry for 
supernova candidates, we subtract the data listed in column 2 from the data listed in column 
3. Note that sample #2 has the deepest references. 

3. The Supernova Candidate Selection and Typing 

The search for supernova candidates and their subsequent typing as la's is a 3-stage 
process. We will briefly describe them below, and then in detail in sections I3.1H3.3I 

1. First, potential supernova candidates in individual epochs are identified by the software 
that is used to subtract the supernova search data from the reference data. The initial 
candidate selection is done using the F850LP data only because it suffers less from 
cosmic ray contamination, and because F850LP covers supernovae at redshifts up to 
~1.5. The initial supernova selection is primarily directed toward reducing the number 
of false positives resulting from various image processing artifacts and residual cosmic 
ray contamination. It is followed by a manual scan to reject any obvious remaining 
cosmic rays and image processing artifacts. Note that both sources of false detections 
have specific signatures that real supernovae do not have; this selection therefore is 
not expected to reduce the number of real supernovae in the sample. This stage is 
described in detail in section 13. 1[ 

2. For the candidates on individual epochs that pass the first stage of the selection process, 
we extract the photometric information at the candidate locations in the multi-epoch 
F850LP and F775W data. We then select candidates with reasonably well-measured 
light curves by requiring that the candidate's SNR in the subtracted data (in both 
filters) be greater than 2 for at least 3 search epochs, including at least two with a SNR 
greater than 3. At the end of this stage, we are left with the majority of candidates 
that are presumed to be supernovae of some type, as well as some candidates that 
cannot be modeled as any known supernova type. This stage is described in detail in 
section 13. 2[ 

3. The final step applies a Bayesian likelihood technique that assigns each candidate that 
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passed steps 1 and 2 a probability to be a Type la supernova based on the multi-epoch 
data in both filters. This stage is described in detail in section 13.31 



For convenience, we summarize the selection process in Table |3l We now describe each of 
the selection stages in detail. 



Selection Stage 


Data Used 


Cuts Applied 


1 


F850LP 
single (discovery) epoch 


SNRcxposure > 3 in 4 exposures 
SNR consistency in 3 out of 4 exposures 
Percent increase > 15% in combined exposures 
shape cuts in combined exposures 


2 


F850LP, F775W, all epochs 


> 3 epochs with S/N > 2 
(including > 2 epochs with S/N > 3) 


3 


F850LP, F775W, all epochs 


Bayesian Type la classification 



Table 3: A summary of the Type la supernova selection and typing process. The meaning 
of the cuts is explained in the text describing the corresponding stages. 



3.1. Stage 1: Single Epoch Supernova Candidate Selection 

In the first step of the supernova search, we search for supernova candidates in the 
individual epochs of the F850LP data by looking for signals in the reference-subtracted search 
images. The reference image is the same for each exposure (recall that each F850LP image 
consists of four exposures, each with the same exposure time). We use aperture photometry 
with a radius of 3 pixels, where the pixel scale is 0.03" (after drizzling). This choice of 
the aperture optimizes the SNR of supernova candidates. We verified that the photometric 
extraction procedure is working well by creating "fake" supernovae, as described later in this 
section, and comparing their input and output magnitudes; they agree at the sub-percent 
level. The procedure for identifying supernovae is as follows. 

i. Subtracting the combined (drizzled) exposures of the search data from the (drizzled) 
reference data, we require that: 

• The absolute value of the fiux within the supernova candidate's aperture in the 
subtracted data divided by the fiux in the reference data (the "percent increase" 
variable) be > 15%. 
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• The candidate's shape in the subtracted data must be consistent with a point 
source: we require that the candidate's FWHM in both x- and y- directions be < 
4 pixels, and that the absolute value of its normalized xy moment be < 0.5 pixels. 

ii. Next, to eliminate false detections resulting from cosmic rays, we do the following: 

• We consider the four individual exposures of the search images. The SNR mea- 
sured for a supernova candidate in each of these exposures (SNRexposure) should 
be at least 3. A false positive resulting from cosmic rays will likely not be present 
in every individual exposure. 

• We then subtract each of the individual exposures from the reference image at 
the location of the supernova candidate and compare the signal to the quadratic 
sum of the noise. The difference in these SNRs between the exposures must be 
< 3 for at least 3 out of 4 exposures. We are thus allowing one (and only one) of 
the four exposures of the search image to be contaminated by a cosmic ray. 

These cuts eliminate close to 90% of false detections {i.e., the number of detections 
decreases from ~100 per single tile (see Fig. [T]) to ~10). Obvious image processing artifacts 
or cosmic rays that manage to pass these cuts are rejected by manual screening (typically, 
there would be a few such candidates per tile, mostly image processing artifacts), with any 
questionable candidates left in the sample. The efficiency of the manual scan has been 
checked using a sample of ~100 fake supernovae, generated as described below, and 100% 
were correctly identified. The preliminary selection flags any variable objects - supernovae 
of various types, as well as active galactic nuclei (AGNs), etc. In section [3731 we describe 
our approach to selecting Type la supernovae from the sample. 

In order to measure the efficiency of the selection, we used a Monte Carlo simulation that 
puts fake supernovae on real F850LP images. Fake supernovae were also used to develop the 
se lection cuts hsted above in an unbiased way. The technique follows the approach outlined 
in 



Pain et al.l (119961 ) and works as follows. 



First, we run SExtractor (IBertin and Arnoutd Il996l ) v2.3 on the search images that 



have been combined, or drizzled, together from the individual exposures. We do this for a 
number of both North and South GOODS tiles. Using SExtractor's classification of objects 
as galaxies and stars, we create a list of the galaxy positions on the image. Because in our 
analysis we are ignoring candidates near image edges, the galaxies located within 2 galaxy 
full widths at half maximum (also determined by SExtractor) from the image boundaries 
are discarded. The fake supernova that is to be put on the image is randomly assigned 
a magnitude that is drawn from a fiat distribution between 23 to 30. The supernova's 
position is drawn from a Gaussian distribution with half the galaxy's full width at half 
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maximum as the standard deviation and centered on the galaxy's nominal center. We then 
use STSDAd^l tranback function to convert the fake supernova positions on the drizzled 
images into coordinates on the raw i ndividual exposures. Fake supernovae themselves are 



created using the TinyTim software (IKrist and Hookll2004l ). for the ACS WFCl camera, in 



filter F850LP. The fake supernova signal, combined with a noise generated using a Poisson 
distribution with the signal's mean for each pixel, is added onto the input exposures, which 
are subsequently processed in exactly the same way as real data are. 

We generated ~13,000 fake supernovae (the 100 supernovae used for the check of the 
manual scanning efficiency were a subset of this sample) . The fake supernovae that pass the 
stage 1 selection cuts described above are compared with the input list of fakes. This allows 
us to calculate the efficiency of the selection cuts for the preliminary supernova selection. 
This efficiency is shown in Fig. |2] (upper left) as a function of the candidates' SNR, and in 
Fig. [2] (upper right) as a function of the candidates' magnitude, on the reference-subtracted 
search images. Note that our reference images are not uniformly deep: they consist of 2, 
4, or 5 combined epochs, depending on the tile of the GOODS field and the supernova's 
position on the tile (see Fig. [T]). Figure [2] (lower left) shows the supernova finding efficiency 
as a function of the SNR for two representative cases: i) for all locations where two epochs 
contribute to the reference data; and ii) for all locations where there are four epochs that 
are available for the reference data. We refer to these "depth 2" and "depth 4", 

respectively. It is evident that, within errors, for a given SNR, the efficiency is independent 
of the depth of the reference image at the location of the fake supernovae, as it should be. 
We thus use the efficiency curve in Fig. |2] (upper left) that combines all of the depths, which 
we fit to the following four-parameter function: 

e{SNR) =pi + — r (1) 

where we obtain pi = 0.96, p2 = —18.04, = 0.41, and p4 = —1.34. The resulting fit is also 
shown in Fig. [2] (upper left). 

One concern in supernova searches is the potential loss of candidates located close to 
the core of their host galaxies. Figure [2] (lower right) shows the efficiency as a function of 
the supernova's distance from the galaxy core. It is apparent that the efficiency remains 
essentially flat. 



^STSDAS and PyRAF are products of the Space Telescope Science Institute, which is operated by AURA 
for NASA 
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30 



Fig. 2. — The efficiency of the stage 1 supernova selection in the reference-subtracted search 
images. The errors are assigned using binomial statistics. The upper left plot shows the 
combined efficiency for all depths (see text for the definition of depth) of the reference image 
as a function of the candidate's SNR. Overlaid as a solid line is the fit of the efficiency to 
the function in Eqn. [H The upper right plot shows the same as a function of the candi- 
dates' magnitude. The lower left plot shows the efficiencies for two representative depths: 
2 (squares) and 4 (down turned triangles). The lower right plot shows the efficiency as a 
function of the supernova distance from the host galaxy core for all candidates with SNR > 
15; the insert shows the efficiency for the candidates with SNR < 15. 
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3.2. Stage 2: The Multi-Epoch Selection 



The second stage of the supernova candidate selection is where we turn to the multi- 
epoch photometric data in both filters. Subtracting the stacked image of each epoch of the 
search data from the reference data, we calculate the candidates' SNRs in the subtracted 
data and require that there be at least three epochs with a SNR > 2, including at least two 
epochs with the SNR > 3. These cuts are designed to select candidates with reasonably well- 
measured light curves. Because the Bayesian technique described in 13.31 provides a powerful 
discrimination of Type la supernovae, these cuts can be very loose. At the end of stage 2, 
we have 26 candidates in sample 7^1, 17 candidates in sample 7^2, 9 candidates in sample 
7^3, and 5 candidates in sample 7^4, for a total of 57 candidates. A list of these candidates 
is given in Tables HI O O and [3, for samples #1, #2, #3, and #4, respectively. The tables 
specify the supernova n ames and classificat ions (gol d, silver, or bronze ) for the candidates 
that were also found in iRiess et al.l (l2004al ) and in iRiess et al.l (120071 ). The classification 
refers to the degree of belief in the typing of the candidate, with gold being certain. For 
sample 7^1, we have 8 gold and 1 silver la's, and 3 silver core-collapse (CC) supernovae. For 
samples 7^3 and 7^4, we have 5 gold and 2 silver la's, and 1 gol d and 1 silver CC. There were 
6 additional gold and silver la's found in iRiess et al.l (l2004al ) that failed our stage 2 cuts 
(SN-2003eu, SN-20021g, SN-2002fx, SN-2003ak, SN-2003eq, and SN-2003al) because they 
did not have a sufficient number of epochs with high enough SNR. In other words, these 
candidates fall below the threshold that is intentionally set high enough that an automated 
Bayesian classification of candidates (discussed in section 13.31) may be possible. Note also 
that the failure of real SNe la to pass stage 2 cuts is taken into account in the control time 
calculation (Section l4.ip . 

Spectro scopic redshift s (of t h e host, the SN or bo th) were tak e n fro m the foUow- 



mg so u rces 



Stroker et al.l fl2004j). iRiess et al.l (l2004al). ICo 



kooi \. IWirthI J2004j ). lie Fevre et al.l toO^ . IVanzella et al.l kood \. and kiess et al.l J2OO7I ). 



len et al.l (l2000l). ICowie et al. 



In some cases, the spectroscopic redshift has been determined more than once. We find good 
agreement in such cases. I f a sp e ctroscopic redshift w as not available, we used photomet ric 
redshifts from IWolf et all J2004j ) . Istroker et al.l J2004I ). and iMobasher and DahlenI J2004j ). 



The host galaxies of three candidates (candidates 7^9 and 7^25 in Table H] and candidate 
7^12 in T able ED were observed w ith the Subaru Faint Object Camera and Spectrograph 
(FOCAS; kashikawa et al.l (120021 )) on May 17, 2007 All three host galaxies were observed 
with the 300R grism and the S058 order sorting filter, resulting in spectra covering the 
5800-10000 A spectral region with a resolving power of ~ 300. Single emission lines were 
detected in the first two galaxies. If these lines are due to the [Oil] doublet at 3727 A, then 
the redshifts of these sources are z = 1.143 ± 0.001 and z = 0.618 ± 0.001, respectively. 
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The first measurement confirms the redshift reported in IStrolger et al.l (120041 ) . The second 
measurement is new. Although the continuum of the third galaxy was detected, no clear 
spectral features are apparent, so we used the photometric redshift instead. 

The typical error in the redshift that is measured spectroscopically is 0.001, if the 
redshift was determined from host galaxy lines, or 0.01, if the redshift was determined from 
supernova features. For photometric redshifts, the error is larger, ranging from 0.05 to as high 
as 0.4. The source of the redshift errors is listed in the tables as well. Precision photometric 



measurements for previously unpublished candidates will be made available in ISuzuki et al. 



(|2008|) 



Cand 


HA (2000.) 


Dec (2000.) 


Redshift 


Error 


Source 


Ref 


Comment 


P(la|{Da,2) 


1 


12 


37 


06 


938 


+62 


09 


15 


81 


0.53 


0.25 


host phot 


M&D 




1.0 


2 


12 


37 


01 


537 


+62 


11 


28 


66 


0.778 


0.001 


host spec 


C04 




1.0 


3 


12 


36 


56 


336 


+62 


11 


55 


65 


0.83 


0.10 


host phot 


M&D 




1.0 


4 


12 


37 


49 


350 


+62 


14 


05 


71 


0.41 


0.01 


spec 


S04 


silver CC (SN-2002kl) 


0.3 


5 


12 


36 


21 


291 


+62 


11 


01 


24 


0.633 


0.001 


host spec 


TKRS 




0.9 


6 


12 


37 


08 


396 


+62 


14 


23 


98 


0.564 


0.001 


host spec 


TKRS 




0.9 


7 


12 


37 


40 


658 


+62 


20 


07 


42 


0.741 


0.001 


host spec 


TKRS 




1.0 


8 


12 


36 


16 


850 


+62 


14 


37 


30 


0.71 


0.05 


host phot 


S04 


bronze la (SN-20021ih) 


1.0 


9 


12 


37 


28 


421 


+62 


20 


39 


56 


1.141 


0.001 


(host + SN) spec 


R04 


gold la (SN-20021ii) 


1.0 


10 


12 


36 


38 


130 


+62 


09 


52 


88 


0.513 


0.001 


host spec 


S04, TKRS 


silver CC (SN-2003bc) 


0.0 


11 


12 


37 


25 


126 


+62 


13 


16 


98 


0.67 


0.01 


SN spec 


R04 


gold la (SN-2003bd) 


1.0 


12 


12 


36 


24 


506 


+62 


08 


34 


84 


0.954 


0.001 


host spec 


S04, TKRS 


silver CC (SN-2003bb) 


0.8 


13 


12 


36 


27 


828 


+62 


11 


24 


71 


0.66 


0.05 


host phot 


S04 


bronze CC (SN-2003ew) 


1.0 


14 


12 


37 


19 


723 


+62 


18 


37 


23 


1.27 


0.01 


SN spec 


R04 


gold la (SN-2003az) 


1.0 


15 


12 


37 


15 


208 


+62 


13 


33 


55 


0.899 


0.001 


(host + SN) spec 


R04, TKRS 


gold la (SN-2003eb) 


0.0 


16 


12 


36 


55 


441 


+62 


13 


11 


46 


0.954 


0.001 


(host + SN) spec 


R04, TKRS 


gold la (SN-2003es) 


1.0 


17 


12 


36 


33 


179 


+62 


13 


47 


34 


0.54 


0.05 


host phot 


S04 


bronze la (SN-2003en) 


0.9 


18 


12 


36 


57 


900 


+62 


17 


23 


24 


0.529 


0.001 


host spec 


TKRS 




1.0 


19 


12 


36 


39 


967 


+62 


07 


52 


12 


0.48 


0.05 


host phot 


S04 


bronze CC (SN-2003dz) 


0.9 


20 


12 


36 


31 


772 


+62 


08 


48 


25 


0.46 


0.05 


host phot 


S04 


bronze CC {SN-2003dx) 


0.0 


21 


12 


37 


28 


992 


+62 


11 


27 


36 


0.935 


0.001 


host spec 


S04, TKRS 


silver la (SN-20031v) 


N/A 


22 


12 


37 


09 


189 


+62 


11 


28 


17 


1.340 


0.001 


(host + SN) spec 


R04, TKRS 


gold la (SN-2003dy) 


1.0 


23 


12 


37 


12 


066 


+62 


12 


38 


04 


0.89 


0.05 


host phot 


S04 


bronze CC {SN-2003oa) 


0.4 


24 


12 


36 


15 


925 


+62 


12 


37 


38 


0.286 


0.001 


host spec 


S04, TKRS 


bronze CC (SN-2003ba) 


N/A 


25 


12 


36 


26 


718 


+62 


06 


15 


16 


0.618 


0.001 


host spec 


this paper 




N/A 


26 


12 


36 


26 


013 


+62 


06 


55 


11 


0.638 


0.001 


(host + SN) spec 


R04, TKRS 


gold la (SN-2003be) 


1.0 



Table 4: The candidates selected at the end of stage 2 for sample #1. Listed are the can- 
didates' coordinates, redshifts, errors on the redshifts, the sources used for the redshift and 
redshift error determination, the references for the sources, and P{la\{Di}, z) defined in sec- 
tion l3.3l (the "N/A" stands for a special category of c andidates de s ignate d as "anomalies", as 



described in section |373|) . For the candidates found in lRiess et al.l (l2004a), the tables a lso lis t 



the sup ernovae' name and cl a ssifica tion (gold , silver, or br o nze). COO is 



Cohen et all J200oh 



H03 i s iHornschemeier et al.l (120031), C04 is ICowie et all km^ VVD S is li e Fevre et al. 



( 2004[) M&D is lMobasher and DahlenI (120041) . S04 islStroker et a.. 
J2004ah. TKRS islwirthl J2004h . W04 is lWolf et al.l hm^ . F2 is 



R07 is Riess et al. 



(120041). R04 is iRiess et al. 
Vanzella et aP (|2006|), and 



(120071 ). 



Note that the redshifts of candidate #17 in Table [5] and candidate #4 in Table [6] are 
uncertain, since the assumed host galaxies of the supernova candidates are 7" and 4" away. 
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Cand 


RA (2000.) 


Dec (2000.) 


Rcdshift 


Error 


Source 


Rcf 


Comment 


P(Ia|{Da,z) 


1 


12 


36:20.889 


+62 


10 


19 


24 


1.10 


0.28 


host phot 


M&D 




1 





2 


12 


36:29.474 


+62 


11 


41 


40 


1.35 


0.40 


host phot 


M&D 










3 


12 


36:19.901 


+62 


13 


47 


67 


0.535 


0.001 


host spec 


TKRS 










4 


12 


36:27.131 


+62 


15 


09 


27 


0.794 


0.001 


host spec 


TKRS 










5 


12 


36:32.238 


+62 


16 


58 


38 


0.437 


0.001 


host spec 


TKRS 







4 


6 


12 


38:03.689 


+62 


17 


12 


23 


0.280 


0.001 


host spec 


C04 







4 


7 


12 


37:09.495 


+62 


22 


15 


37 


1.61 


0.34 


host phot 


M&D 




1 





8 


12 


37:06.772 


+62 


21 


17 


46 


0.406 


0.001 


host spec 


TKRS 







1 


9 


12 


36:26.694 


+62 


08 


29 


74 


0.555 


0.001 


host spec 


TKRS 







8 


10 


12 


36:54.125 


+62 


08 


22 


21 


1.39 


0.01 


SN spec 


R07 


gold la (HST04Sas) 


1 





11 


12 


36:34.363 


+62 


12 


12 


55 


0.457 


0.001 


(host + SN) spec 


TKRS, R07 


gold la (JST04Yow) 


1 





12 


12 


37:33.918 


+62 


19 


21 


75 


0.88 


0.38 


host phot 


M&D 




1 





13 


12 


36:34.853 


+62 


15 


48 


86 


0.855 


0.001 


(host + SN) spec 


R07, TKRS 


gold la (HST04Man) 


1 





14 


12 


36:36.009 


+62 


17 


31 


97 


0.60 


0.15 


host phot 


M&D 







2 


15 


12 


36:55.214 


+62 


13 


03 


75 


0.952 


0.004 


(host + SN) spec 


COO, R07 


gold la (HST04Tha) 


1 





16 


12 


37:48.435 


+62 


13 


34 


85 


0.839 


0.001 


host spec 


TKRS 




1 





17 


12 


36:01.542 


+62 


15 


55 


16 


0.086 


0.001 


host spec 


H03 




N/A 



Table 5: Same as Table H] for sample #2. The redsliift of candidate 7^17 is uncertain, as the 
possible host galaxy is 7" away. Leaving this redshift as unconstrained does not change our 
results. 



Cand 


RA (2000.) 


Dec (2000.) 


Rcdshift 


Error 


Source 


Ref 


Comment 


F(Ia|{I3,},z) 


1 


03:32:18.072 


-27:41:55.83 


0.88 


0.05 


host phot 


S04 


silver la (SN-2002fy) 


0.9 


2 


03:32:13.002 


-27:42:05.75 


0.421 


0.001 


host spec 


VVDS 




0.0 


3 


03:32:37.511 


-27:46:46.40 


1.30 


0.01 


SN spect 


R04 


gold la (SN-2002fw) 


1.0 


4 


03:32:05.060 


-27:47:02.96 


0.976 


0.001 


host spec 


VVDS 




N/A 


5 


03:32:17.309 


-27:46:23.74 


0.13 


0.01 


phot 


W04 




0.0 


6 


03:32:48.598 


-27:54:17.14 


0.841 


0.001 


host spec 


S04, VVDS 


silver CC (SN-2002fz) 


0.9 


7 


03:32:22.751 


-27:51:09.65 


Unconstrained 


Unconstrained 


phot 


S04 


bronze CC (SN-2002fv) 


0.0 


8 


03:32:42.441 


-27:50:25.08 


0.58 


0.01 


spec 


S04 


gold CC (SN-2002kb) 


N/A 


9 


03:32:38.082 


-27:53:48.15 


0.987 


0.001 


host spec 


S04, VVDS 


bronze la (SN-2002ga) 


1.0 



Table 6: Same as Table H] for sample #3. The redshift of candidate #4 is uncertain, as the 
possible host galaxy is 4" away. Leaving this redshift as unconstrained does not change our 
results. 



Cand 


RA (2000.) 


Doc (2000.) 


Redshift 


Error 


Source 


Ref 


Comment 


P(la|{13,},2) 


1 


03:32:24.782 


-27:46:18.07 


1.306 


0.001 


host spec 


R04, F2 


gold la (SN-2002hp) 


1.0 


2 


03:32:22.522 


-27:41:52.26 


0.526 


0.001 


(host + SN) spec 


R04 


gold la (SN-2002hr) 


1.0 


3 


03:32:22.318 


-27:44:27.04 


0.738 


0.001 


host spec 


R04, F2 


gold la (SN-2002kd) 


1.0 


4 


03:32:05.382 


-27:44:29.76 


0.91 


0.05 


host phot 


S04 


silver la (SN-2003al) 


1.0 


5 


03:32:34.648 


-27:39:58.18 


0.214 


0.001 


(host + SN) spec 


R04, VVDS 


gold la (SN-2002kc) 


1.0 



Table 7: Same as Table H] for sample #4. 
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respectively. However, we have verified that if we leave the redshifts of these candidates as 
unconstrained, it does not affect our final results. 



3.3. Stage 3: The Identification of Type la Supernovae 



The candidates that have been selected in stages 1 and 2 are assumed to be real transient 
objects, most likely supernovae, and must now be classified by type. With only scarce pho- 
to metric data available, we t u rn to the Bayesian method of classifying supernovae described 
in 



Kuznetsova and ConnoUyi (120071 ). 



Pho tometric typing of supernov ae h as been described inlPoznanski et al.l (120021 ). iRiess et al 



( l2004bl ). I Johnson and Crottd (120061 ) and lSuUivan et al.l (120051 ) . among others Most of the ex- 
isting methods rely on color-color or color-magnitude diagrams for supernova classification. 



In our method, we consider five possible supernova types ("normal" la (IBranch et al. 



19931 ). Ibc, IIL, IIP, and Iln). We make use of the best currently available supernova multi- 
color lightcurve templates for each type. When improved supernova templates are available, 
they can be easily worked into the method. We calculate the probability that a given 
supernova candidate with photometric data {-D^}, where i is the index for the number of 
observational epochs, and redshift z is a Type la supernova. By virtue of the Bayes theorem, 
this probability is given by: 



P(Ia|{A}, z) 



/g-P({A}, z\9, la)P{9, la)d9 
j:^JgPi{D,},z\e,T)Pi9,T)de 



(2) 



where z is the measured supernova redshift; 6 are the parameters that characterize a given 
supernova type; {-Dj} are the data in both F850LP and F775W; P{{Di} z\9, T) is the proba- 
bility density to obtain data {-Dj} and redshift z for supernova type T; P{9, T) contains prior 
information about type T supernovae; and the denominator contains the normalization (the 
sum) over all five supernova types T considered. The parameters 9 = {z, t^is, s, M, Ry, Ay) 
are: z is the true supernova redshift; t^m is the time difference bet ween the dates of max i- 
mum light for the template and the data; s is the stretch parameter (jPerlmutter et al.lll997l ). 
which parametrizes the width of the light curve (if T = la); M is the absolute magnitude in 
the restframe i?-band at maximum light; and Ay and R y are the Cardelli-Clayton-Mathis 
interstellar extinction parameters (jCardelli et al.l Il998l ). We marginalize (integrate over) 
these parameters as described below. 

Suppose that we have a photometric template, {D{9,T)i}, for the expected light curve 
for a supernova of type T at a given redshift, z. In this work, we use the templates from P. 
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E. Nugentlfl, which extend both into the UV (below 3460 A in the supernova rest frame) and 
into far red and IR (above 6600 A in the supernova rest frame) regions. Is it assumed that 
the measured hght curve flux, {-Dj}, can fluctuate from the template {D{9, T)j} according to 
Gaussian statistics. It is also assumed that the probability of measuring redshift z fluctuates 
around a mean z according to Gaussian statistics as well. Therefore, 

P({A},^|^,T) = ^^^^W^ TT ^ ' (3) 



where 6Di are photometric measurement errors for epoch i, and 6z is the measurement error 
for the redshift z. Note that we assume no errors on the supernova templates themselves; 
we take them to represent the best currently available knowledge of the supernova behavior. 
However, it is also worth noting that various parameters that characterize a given template 
[e.g., the peak restframe -B-band magnitude, the stretch parameter for la's, etc.) are varied 
as described below, thus effectively representing some template variations. 

The prior P{6, T) contains all the available information about the behavior of type T 
supernovae, expressed in terms of parameters 6. We assume that all constituents of can 
be divided as follows where tdiff, z and T, M, Ry and Ay and s are independent. 

P0, T) = P{td,s\z, T) P{M\z, T) P{s\z, T) P{Ry, Ay\z, T) P{z, T). (4) 

The assumed independence of the parameters is certainly an oversimplification. For example, 
one would expect the stretch and magnitude parameters to be correlated (although the 
true values of these two parameters should be independent of tdifr, Rv and Ay). Ignoring 
the correlation might conceivably lead to an overestimation of the probabilities for very 
bright Type la's with a small stretch parameter, or very dim Type la's with a large stretch 
parameter. However, we are exploring every possible combination of stretch and magnitude 
parameters; the "correct" combination should naturally be a better "fit" to the data, thus 
acquiring a larger weight than all the other ones. 

The prior P{z, T) includes the relative rates of the various supernova types as a function 
of redshift. Unfortunately, these rates are not well known, especially at high redshift. We 
will thus consider three different models for the ratio of the CC supe rnova rates to the la 



supernova rates. The models are based on lDahlen and FranssonI (119991 ). and shown in Fig. [31 



They correspond to three different v alues of the characteri s tic tim e delay parameter r: r 



1 Gyr, 2 Gyr, and 3 Gyr. Based on iDahlen and FranssonI (119991 ). we will also assume that 



the relative (rounded-off) fractions of the CC supernovae are fibc = 0.27, fuL = 0.35, fup 
= 0.35, and = 0.02, for all three models, regardless of the redshift. 



^See |http: / / supernova.lbl.gov/~nugent/nugent_templates.html 
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Fi g. 3. — The three models for the ratio of the CC/Ia rates as a function of redshift based 



on 



Dahlen and FranssonI (jl999l ). The sohd hne is for the time delay parameter r = 1 Gyr; 



the dashed line is for r = 2 Gyr; and the dashed-dotted line is for r = 3 Gyr. 

Note that the usage of these models does not bias our answer in any way, as we are 
not making any assumptions about the absolute rates of supernovae, but only about their 
relative rates. If we assume all three models to be equally likely, then the probability density 
P(T) of observing a supernova of type T for assumption n about the relative rates of the 
CC to la supernovae is given by: 

P{z,T)= (5) 

where Rn{z,T) is the rate of type T supernovae for model n, and Nmodeis = 3. 

The difference in the dates of maximum light between the template and the data, t^m, 
can also take on any value, making the prior P{tdm\T) flat. In practice, we shift the relative 
dates of maximum between the measured and the template light curves by increments of 
one day. The marginalization of this parameter thus amounts to a sum over a finite number 
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-/.max j-min ' 

''diff "-diff 



(which we take to be 160) of such shifts. 

P{Um\z,T) -- 

where the maximum f^^^ and minimum set the hmits on tdiff- 
The priors on P{M\z^T) and P(s|^, la) are taken to be Gaussian: 

P{M\z,T) = —= . 



(6) 



(7) 



P{s\z, la) 



27i6s 



A table of the mean magnitudes M and the standard deviations (5M, as well as the values 
for th e mean stretch s and the standard deviation 6s, are given in iKuznetsova and Connolly 
( 120071 ) ■ For reference, we extract the mean magnitud es M in the restfr a me -B -band from P. 
E. Nugen10, and the standard deviations, 5M, from Richardson et al. (2002). The stretch 
parameters are extracted from ISuUivan et al.l (120061 ). Note that for non-la's, a complete set 
of "virtual" values for the stretch pa rameters are inserted into Eqn. [5] and then marginalized 
with a flat prior (see Appendix B in IKuznetsova and ConnoUyl (120071 )). 



The effects of interstellar extinction are difficult to parametrize due to lack of generally 
accepted models for the behavior of the Cardelli-Clayton-Mathis parameters Ay and Ry- We 
compromise by considering a case of no extinction and two cases of extinction with a moder- 
ate value of Ay = 0.4 and two different values of Ry, 2.1 and 3.1. The mathematical frame- 
work used in the analysis easily allows for the implementation of real distributions for Ay 
and Ry, on ce they become standardized. It is known that in simulations Ay is sharply peaked 
near (e.q.. lHatano et al.l (119981 ) : lfor more recent treatment, see also Riello and PatatN2005h ): 
therefore, not considering very large values of Ay is reasonable. All three cases {Ny = 3) 
are considered equally possible. In other words, we take: 



P{Ry,Ay\z,T) = — 

lyy 



(9) 



It is certainly a simplified extinction model; however, it appears to be sufficient as demon- 
strated by the largely successful t yping of known T ype la candidates in two such diverse 
samples as the 73 SNLS-identified (lAstier et al.l 120061 ) Type la's and the gold and silver la's 



See http://supernova.lbl.gov/~nugent/nugent_templates.html 
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in the HST GOODS data (iKuznetsova and ConnoUyl 120071 ) . The method correctly identi- 
fied 69 out of the 73 SNLS Type la's. For the remaining four candidates, at least one filter 
band included wavelengths outside of the well-understood optical range in the supernova rest 
frame. It also correctly identified 7 out of 8 gold and silver la's, and 5 out of 5 gold and silver 
CC's. Another consideration to note here is that extinction primarily affects the measured 
magnitudes, and our model already takes into account wide variations in the magnitudes 
(Eqn.lH). 

Putting everything together, we see that the numerator of Eqn. [2] is given by: 
P({ A}, ^1^, la)P{0, la)d9 = 



E 



2tt5z 
As 



-I -^models -i Mmax A 1\ f 

y^fr'^'^ Ri(z,T) ^ ' ^ V2^6M 

Z—il=l "tv^; ; n=l n„=l M=Mmin 



j-max j-min 

''diff '■difl 



'^'"p^^" exp(- 



n 



25D? 



for la's, and for types T' that are non-la's, it is: 
P{{Di},z\e,T')P{e,T')d6 = 



(10) 



E 



Az 



Nrr 



M„ 



2tt6z 



E^models R 
1 = 1 ^' 



— y Rn{z,T')—y AN, y 



AM (M-Mf 



2ttSM 



Miff 



•.pochs exp(- ^ 

^\ V2^5Di 
1=1 * ' 



diff 
j.min 
Miff 



(ii; 



In Eqns. [10] and [TTl we marginalize over parameters 6, approximating the integration by 
summation. The range of redshifts [zzmin, Zmax] is taken to be from to 1.7 in the denomi- 
nator of Eqn. [21 and over a bin of interest in the numerator (this point will be explained in 
more detail later in this section), and we take Az = 0.05. The mean values of z and the error 
on the z, 6z, are given in Tables [H, [5l [6l and [7] for the candidates used in the analysis. Atjiff 
is one day, and AA^^, = 1. We sum M from M^m = -36 M to Mmax = with a total 

of 12 steps, and we sum s from Smin = 0.65 to Smax = 1-3 in 14 steps. For non-Type la's, a 
complete set of "virtual" values for the stretch par ameters are inserted into Eqn. [51 and then 
marginalized with a fiat prior (see Appendix B in IKuznetsova and ConnoUyl (120071 ) ) . 



The probability that a^^ candidate is a Type la supernova belonging to the j^^ redshift 
bin, \_Zj iQiu^y^^ Zj iipp^y^^^ is thus. 

i?r;: /^^({ ai, z\9, u)Pie, la) de 



Et iT dz A}, z\d, T)P{e, T) dd 



(12) 
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Let us now introduce the following variables: 

» Nj is the total count of the candidates contributing to the j^^ redshift bin. 

» Pj' is the Bayesian probabihty for each candidate a in the j^^ redshift bin [a 
[l,...,iV,.]). 



{P"}j is the full set of probabilities for the candidates in the j*^ redshift bin. 
dj is the most likely number of la candidates in the j*^ redshift bin. 



Our goal is to find dj, as well as the error on this number, given A^^ and -P"'s. 

If Nj is large, say of order 100 (which is the case for our Monte Carlo samples), then dj 
can be simply evaluated as: 



a=l 



where the uncertainty on dj is given by the square root of the binomial and Poisson variances: 

Ad, 



\ 



Y^p^{i-p^) + Y.pf. (14) 



a=l a=l 



Note that if all of the probabilities P^ were 1 {i.e., the candidates were all known to be 
Type la supernovae), using Eqn. [13] would amount to a simple counting of the number of 
candidates, and Eqn. [H] would become the usual \/N~j error for a large number of events 

For a small number of events, Nj < 10, which is typically the case for our data samples, 
using Eqn. [T3l and [Ml would be incorrect. A more sophisticated approach is needed. Let us 
define a variable Xa such that = 1 if the a^^ candidate is indeed a Type la and = if 
it is not, so that there are kj = ^^Li^a Type la's in this bin. The probability to obtain dj 



is given by: 



P{d,\{P-},) = J2 Pidj\{Xa})P{{Xa}\{P''h) = (15) 
{xa} 

^ p(K}M,)P(^,)^(^.)^^ "^'^ ^ ^ 



{Xa} 
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where the sum on dj can, in principle, extend to arbitrarily large values (for example, if Nj 
= 2, there is still a small but non-zero probability that dj can be 100). We will assume a 
fiat prior for P{dj), in which case the denominator integrates to unity. 

The first term in Eqn. [T6]is a normalized Poisson distribution for the expected dj number 
of events while kj = X^aii events are assumed to be in the j^^ bin: 

P{{xa}\dj) = ^ ^ ^ , where kj = 2_^Xc,. (17) 

^' a=l 

The term P{{xa}\{P°'}j) in Eqn. [16] is the probability that certain supernovae do or do not 
occupy the j*^ bin. This probability is simply: 

a=l 

Because we have no way of knowing a priori which candidate belongs in the j*^ bin, we 
must sum over all possible {Xq,}'s: 

{Xa} ^' a = l 

To obtain the best estimate for dj, we must maximize P{dj\{P°'}j) given in Eqn. [191 In 
practice, this is done numerically for a range of test rf^'s from to some maximum djmax 
(we arbitrarily tak;e it to be 50) to find out which dj maximizes the probability. 

Let us consider an example. Suppose that we have two supernovae in a given bin, with 
probabilities of being la's given by P^ = 0.8 and P^ = 0.9. The possible permutations of 
XaS would be (0,0), meaning that neither candidate is a Type la; (0,1) and (1,0), meaning 
that only one candidate is a Type la; and (1,1), meaning that both candidates are la's. Then 
we need to maximize 

d^e-"^, , d^e-'^ , , d^e-'^ , , rf^ "'^ , , 

-^(1 - 0.8)(1 - 0.9) + ^p0.8(l - 0.9) + ^p(l - 0.8)0.9 + ^pO.8 x 0.9 (20) 

as a function of d. For this particular example, the best estimate for the number of Type 
la's is in fact 1.68lQ;5g, where the errors are estimated as described below. 

To evaluate the uncertainty on dj, we find the 68% confidence regions for dj, \dj — Ojiow, 
+ o-j/iig/i], by solving: 



16%= P{d,\{P-},)d{dj)= P{d,\{P-},)d{d,) (21) 

''O J d+o-j high 
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In the case where dj « 1, we set ajiow = and find CTjhigh by satisfying: 

POO 

32%=/ P(d,|{P"},Kd,). (22) 

We assume that all candidates whose redshift is within ±3 Sz of the j*'' bin's boundaries 
(where Sz is the uncertainty on the candidates' redshift, listed in Tables HI [5l [6l and[7l) will 
contribute to this bin. Note that in this formulation, a single candidate with a poorly known 
redshift may have a probability distribution that spans several redshift bins. 

We calculate P(Ia| {-Dj}, z) for all 57 candidates. If a given candidate's P{{Di}, z\9, T) P{9, T) 
is less than 10~^^ for all types T , it is considered to be an "anomaly" and is excluded from 
further consideration. The 10~^^ cut was chosen because it is much smaller than the values 
calculated for simulated supernovae in the Monte Carlo. This method thus excludes any 
need for the often subjective and time-consuming decision on whether or not a candidate 
might be a supernova of a given type; all dubious candidates are weighted appropriately and 
left in the sample for the probability to decide. 

It is a good s anity check to exarn ine the values of P{\ci\{Di},z) for the gold and silver 



la candidates from lRiess et al.l (l2004al ). Tables HI [5], O and[7]list P(Ia|{Dj}, z) (with [zjiower-, 
Zjupper] = [0.0, 1.7]) for all of the candidates. Several candidates have "N/A" listed for 
P(la.\{Di}, z): these are the "anomalous" candidates, as described above. It is apparent 
that the gold and silver la candidates are among the largest contributors to a given redshift 
bin. All but one of them, SN-2003eb, have probabilities > 0.8. SN-2003eb has only two 
epochs (epochs 4 and 5 of the GOODS dataset) with "appreciable" SNR (> 10) in both 
F775W and F850LP bands. One silver la candidate, SN-20031v, appears to have a rare 
residual cosmic ray contamination in the F775W band, making it appear inconsistent with 
any of the supernova types considered. Three silver core-collapse supernovae, SN-2002kl, SN- 
2003bb, and SN-2002fz have the probabilities of being la's of 0.3, 0.8, and 0.9 respectively. 
They are in fact most consistent with being Iln's; however, because the fraction of Iln's is 
heavily de-weighted among CC supernovae {fun = 0.02), their resulting P{la\{Di}, z) are 
higher than one would have expected. How much do our assumptions about the fractions 
of various supernova types among the CC supernovae infiuence our answer? As we will see 
in Section 13.3. H if we assume that all CC types are equally likely and that the ratio of the 
CC to la rates is redshift independent, the changes to our final results are within the quoted 
uncertainties. 

Another sanity check is to make sure that the candidates with low P(Ia|{Dj}, z)'s are 
not all of a particular class {e.g., Ibc's). We have verified that indeed they are not. 

It is worth noting that variable objects other than supernovae, such as AGNs, are 
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selected during the first selection stage. If some of these objects also pass the second selection 
stage, they are unlikely to bias the results significantly, as the specifically designed cuts in 
the third stage would likely reject such candidates. As an extra check, we verified that 
none of the candidates listed in Tables IH El El and [7] that are close (within 3 pixels) to the 
core of t heir host galaxies have a matching x-ray-b right object in the Chandra Deep Field 



catalogs (lAlexander et al.ll2003l : iRosati et al.ll2002l ). The only questionable candidate that 



might have a matching object is candidate #3 in Tabled however, its P(la\{Di}, z) never 
exceeds ~10~^ for any redshift bin considered. 

In order to estimate dj^s, one must select some kind of redshift binning. One must 
be careful about the selection of the redshift bins in an analysis whose goal is to estimate 
the supernova rates, because the use of binning averages the behavior of the rates over the 
width of the bin. However, the uncertainty in the candidates' redshifts forces us to use finite 
bins - or, in other words, it does not make sense to use infinitely narrow bins when there 
is significant uncertainty in the candidate redshifts. For our analysis, we choose the width 
of the bins to be Az = 0.1. Table [8] lists the numbers of observed candidates in these bins, 
as well as their uncertainties, for the four samples listed in Table [2] [dj^ refers to a number 
of candidates in the j^^ redshift bin for the m*^ sample). All the uncertainties refiect a 
68% confidence region. In order to calculate the total numbers of supernovae, dj, we use 
the procedure described above on the combined candidates from all four samples. In other 
words, the total dj is not a trivial sum of the probability distributions of the c?J"s. 



3.3.1. Sensitivity to Varying Priors 

As usual in Bayesian analysis, the errors on the observed number of supernovae dj calcu- 
lated as described in section [3^ are a combination of statistical and systematic uncertainties. 
However, to gain an appreciation for the effect of the prior assumptions on the final result, 
we compute the change in dj^s by varying the calculation of P(Ia|{Dj}, z) from Eqn. [2] in 
three different ways: 

• Large extinction: In section 13. 3^ we considered three discrete cases for extinction: 
no extinction, (Ay, Ry) = (0.4, 2.1), and {Ay, Ry) = (0.4, 3.1). We now add the 
case of {Ay, Ry) = (1.0, 3.1) to the extinction prior, and consider it to be equally 
likely as the cases of no extinction and moderate extinction. It is in fact known that 
a value of Ay = 1.0 is much less likely than, say, an Ay = 0; however, it is in cases 
of strong extinction that the overlap between the magnitude phase space of la's and 
CC's becomes the largest. 
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Table 8: The best estimate (i.e., the most probable) number of la's, c?™, in = 0.1 redshift 
bins (j = [1,..,17]), for the four samples hsted in Table [2] (m = [1,..,4]). The total numbers 
are the results of applying the counting procedure described in the text to the combined 
candidates from all four samples (in other words, the total probability distribution is not a 
trivial sum of the probability distributions for the four samples). All the uncertainties reflect 
a 68% confidence region. 
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Overluminous Ibc's: In iRichardson et al.l (120061 ) . it is pointed out that there may 



exist a sub-class of type Ibc supernovae whose mean restframe 5-band magnitudes 
are much closer to those of normal la's, with M = -20.08, 6M = 0.46. We add these 
supernovae as one more type to our list of supernova types considered, assuming that 
fibc = 0.18 for normal Ibc's, and 0.09 for the overluminous ones. 

Flat ratio of the CC to la rates, all CC types equally likely: Instead of using 
the redshift-dependent models for the ratio of the CC to la supernova rates, we now 
assume that the ratio is redshift-independent, and taken to be 2.15, which is roughly 
the average of the models shown in Fig. [7]). We also assume that the relative fractions 
of the CC supernovae are all 0.25 {fee = 0.25); or, in other words, that all classes of 
the CC supernovae are equally likely. 



The considered alternative priors are deliberately taken to be such that the effect on 
P{la\{Di}, z) should be the most dramatic, without too much regard for whether or not 
such priors are realistic. Table [9] lists the changes in dj relative to the values specified in 
Table [8] as a result of using the alternative priors listed above. 

It is clear from Table that none of the alternative priors considered leads to a change 
in the mean that goes beyond the estimated errors in Table [HI 



The Rates Calculation 



Next, we compute the expected number of candidates in the j redshift bin whose center 
is Zj, given a volumetric Type la supernova rate in the supernova rest frame, rv,ia(z), as a 
function of redshift z. The expected number of candidates is different from the measured 
dj^s: it is calculated entirely based on Monte Carlo simulations of SNe and a given rates 
model. 



dT- = Az;-^^^^iz,] 

^ ^ 1 + Zi An dz^ ^' 



■'^M^[zj) + j—Tcc{zj)ecc[zj) 

rv.iAZj) 



(23) 



where Azj is the width of the redshift bin; B is the survey area covered; dV/dz is the 
comoving volume computed assuming a ACDM cosmology with = 0.7, ^Im = 0.3, and 
Ho = 100 X h7o (km/s) Mpc~^; Tia(^) and Tcc(^) are the control times for la and non-la 
candidates, respectively; eia and ecc are the efficiencies of the stage 3 selection for la and 
non-la candidates, respectively; and rv,cc{z) / rv,ia{z) is the ratio of the non-la supernova 
rate to the la supernova rate. Once again, the appearance of this ratio does not bias our 
results, since we do not make any assumptions about the absolute Type la rate. The control 
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Table 9: A change in the estimates for the numbers of la's, dj, as a resuh of using alternative 
priors for the 6 parameters, as described in the text. Listed are the differences between the 
dj obtained for the alternative parameters and for the default priors, in Az = 0.1 redshift 
bins (j = [1,..,17]), for the combination of the four samples listed in Table [2l 
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time in Eqn. [23] enters with a factor of (1 + z). This is a consequence of the fact that it is 
calculated in the observer frame, as will be described later. 

The control time T is defined as the time during which a supernova search is poten- 
tially capable of finding supernova candidates. In order to calculate it, we simulate HST 
observations of Type la and non-Type la supernovae at redshifts up to 1.7, with the same 
sampling and exposure times as those of the real data. By shifting the observing grid along 
the light curves, we calculate the weighted sum of the number of days during which a given 
supernova could be detected. The weight factors are obtained from the stage 1 efficiency 
parametrization; it is also required that the light curves satisfy the stage 2 SNR require- 
ments. Therefore, stage 1 and 2 supernova selection efficiencies are naturally built into the 
control time calculation. However, the stage 3 selection efficiency is not part of the control 
time calculation, and must therefore be computed separately. Calculating the area of the 
survey is straightforward using a Monte Carlo approach. The calculation of the control time 
and the survey area is given in section 14.11 

The stage 3 selection efficiencies eia and ecc must be calculated for the candidates that 
passed the control time requirements, and thus satisfy both stage 1 and 2 cuts. We create a 
Monte Carlo sample simulating real supernova candidates of five different types, and apply 
stage 1 and 2 cuts to them. We simulate both la and non-la candidates and calculate the 
number of candidates as we would for real data. This procedure is described in detail in 
section 14. 2[ 

The errors on the expected dj^^ are a combination of statistical and systematic un- 
certainties. Apart from the uncertainties inherent in the calculation of P{la.\{Di}, z), the 
dominant systematic uncertainties come from two sources: estimating the variation in the 
control time for la's for values of the lightcurve timescale stretch, s, other than 1, and esti- 
mating the effect of varying the ratio of the rates rv,cc{^) / rv,ia.{z) . The former is described 
in more detail in 14. H and for the latter we use two models described in section 13. 3^ for r = 
1 Gyr and 3 Gyr. 

4.1. The Control Time and Search Area Calculation 

Let us start with describing the calculation of the control time and search area, T 
and G from Eqn. [231 The control time is the time during which a supernova search is in 
principle capable of finding supernova candidates on the area covered. For the GOODS 
fields, the orientation of the tiles is such that a candidate is not necessarily accessible for 
every search epoch due to edge effects (see Fig. [T]). For example, for sample 1 from Table [21 
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a given location may only be covered by epochs 1, 3, and 5 (but not by epochs 2 and 4) 
of the North GOODS dataset. In both our control time calculation and in the search area 
calculation, we thus consider all of the possible epoch permutations at each location: 31 
possible permutations for samples 1,3, and 4 and 15 possible permutations for sample 2. 

We perform separate control time and search area calculations for the four samples listed 
in Table [21 however, the approach is the same. For the control time calculation, we make 



use of the simulation described in some detail in Appendix A of iKuznetsova and Connolly 



( 120071 ). We use it to create simulated HST observations in both F775W and F850LP bands 
for Type la supernovae of stretch 1, as well as for non-Type la supernovae, at redshifts up 
to 1.7 with an increment of 0.1. Separate sets of observations are generated for each possible 
permutation of the available search epochs, for each of the four samples. For example, for 
a supernova from Sample 1 that happens to be present in every one of the North GOODS 
epochs, there will be five simulated search observations and a single reference observation. 
We use typical epoch separations and exposure times for a given sample. The observations 
are realized using an aperture exposure time calculator with a 0.1" radius. We initially set 
the explosion date of the supernova on the last date of the available search epoch observation 
set {e.g., for the supernova example mentioned above it would be on the date the last of 
the North GOODS data were taken). The observing grid for the search observations is then 
shifted by one day, and the procedure is repeated Nshifts = 350 times (that is, spanning 
approximately a year, which is the longest separation between the search and reference data 
for our data samples). For each such shift, we require that the simulated data satisfy both 
the stage 1 and stage 2 requirements listed in Table [31 The resulting control time thus has 
stage 1 and 2 efficiencies automatically included. It is given by: 



hifts 



E 

k=l 



Uil-eliSNR,)) 



Ck, (24) 



where the sum is over all the shifts, Ngp is the number of available search epochs (in the ex- 
ample considered above, Ngp = 5); is a function of the i^^ subtraction's SNR, parametrized 
as in Eqn. [H and is a binary quantity 

f 1 , if fc*^ shift configuration satisfies stage 2 requirements 

\ , if fc*^ shift configuration does not satisfy stage 2 requirements 

that assesses whether a given configuration has enough epochs with sufficient SNR for the 
stage 2 selection. 



We repeat the control time calculation for la's with the lightcurve timescale stretch 
values of s = 0.65 and s = 1.30, weight the results by the probability of obtaining such 
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stretches taken from Eqn. [HI and take the larger error between the control time computed 
for these stretch parameters and that computed for a stretch of 1 as a measure of the 
systematic error on the control time for la's. For reference, Table [TU] lists the control time 
as a function of redshift for both the nominal stretch of 1 and for the stretch of 0.65 and 
1.30, for the configurations in which a supernova candidate is assumed present on all of the 
search epochs. 
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0.40 


0.59 


0.46 


0.32 


0.49 


0.32 


0.27 


0.37 


1.2 


0.63 


0.53 


0.69 


0.48 


0.35 


0.56 


0.42 


0.27 


0.47 


0.32 


0.26 


0.36 


1.3 


0.60 


0.49 


0.67 


0.46 


0.30 


0.55 


0.38 


0.22 


0.45 


0.32 


0.25 


0.36 


1.4 


0.57 


0.42 


0.64 


0.44 


0.23 


0.53 


0.37 


0.17 


0.43 


0.31 


0.20 


0.36 


1.5 


0.56 


0.36 


0.62 


0.39 


0.15 


0.48 


0.32 


0.13 


0.40 


0.31 


0.17 


0.35 


1.6 


0.51 


0.24 


0.59 


0.34 


0.09 


0.44 


0.27 


0.09 


0.37 


0.26 


0.11 


0.34 


1.7 


0.47 


0.17 


0.56 


0.21 


0.04 


0.38 


0.25 


0.06 


0.33 


0.25 


0.07 


0.31 



Table 10: The Type la control time in years as a function of redshift, for the configurations 
on which a supernova candidate is assumed to be present on all of the search epochs. The 
control time is given for three different values of the stretch parameter s: 1 (nominal), 
0.65, and 1.30. Note that this control time has the stage 1 and 2 efficiencies built into the 
calculation. 

Calculating the search area is non-trivial because of the complicated orientations of 
the GOODS tiles, as well as the overlaps between the tiles (see Fig. [1]). In addition, the 
search area must be calculated separately for all of the possible epoch configurations, as 
described above. We perform this calculation using a Monte Carlo method. First, we 
create a 300x300 point grid between the minimum and maximum right ascensions [a) and 
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declinations (6) covering the entire North or South GOODS area [e.g., from a = 12:35:34.85 
and 6 = 62:4:59.45 to a = 12:38:14.7 and 6 = 62:23:36.78 for epochs 1, 3, and 5 of sample 
^ 1). Then, for a given epoch, and for each point i on the grid, we check whether this (a^, 
6i) belongs to any of the images that were used to make subtracted data for this epoch. In 
other words, we convert (ttj, 6i) into image coordinates {xj, yj), and check that: (a) the 
point falls within the confines of at least one search/reference image pairs; (b) it does not 
fall into the gap between the two ACS chips on the search image; and (c) it does not fall on 
a known bad pixel or a pixel that has been masked off for any other reason {e.g., due to a 
residual cosmic ray contamination) on either image, although because of the drizzling there 
are very few affected pixels. If all of these requirements are satisfied, the point is counted 
toward the area calculation. Once counted, a given point can never again be counted for this 
particular epoch. This avoids double-counting, an issue particularly important since most 
GOODS tiles overlap at least somewhat with their immediate neighbors, and a point with a 
given (oj. Si) may well be present on several images. A separate accounting of the number 
of points is kept for each epoch permutation. For example, let us suppose that the number 
of points that cover all five of the GOODS North epochs is ai, and that the number of total 
points tried in the grid is Ai, then the area corresponding to this configuration is Sai/Ai, 
where S is the of the entire North GOODS survey. 

Figure H] shows the resulting product of the control time and surveyed area (G T{z) = 
Yl^=i Ti{z), where n is the number of all possible permutations) for stretch 1 Type la's, as 
a function of redshift for the four different samples in Table [21 There are several interesting 
features in Fig. |H First, the product of the control time and area tends to decrease with 
redshift. This is a consequence of the fact that it becomes more difficult to satisfy the stage 
2 SNR requirements for higher redshift (dimmer) supernovae. Second, for a given redshift, 
the product is smaller for sample #2 than for sample #1, a consequence of the fact that 
there are only 4 search epochs in sample #2 vs. 5 search epochs in sample #1. Third, the 
product is distinctly smaller for the South GOODS samples (samples #3 and #4) than for 
either of the North samples (#1 and #2), a reflection of the fact that for these samples 
we are forced to use references made from two of the GOODS South dataset's own epochs. 
Finally, the product is smaller for sample #4, which uses epochs 1+2 of the GOODS South 
dataset as its reference data, than for it is for sample #5, which uses epochs 4+5. This is 
simply because the rise time of a supernova is smaller than its decline time. 



- 32 - 



X 10 



0.225 j- 

^ 0.2 E- 
*J).175 

XO.15 ^ 
*aP.125 

I 0.1 ^ 

§).075 E- 

<S 0.05 E- 

0.025 





0.2 0.4 0.6 0.8 1 

Redshift 



Sample 1 
Sample 2 
Sample 3 
Sample 4 



1.2 1.4 1.6 



Fig. 4. — The product of the control time and surveyed function of redshift for the 

four samples listed in Table [21 calculated for a stretch 1 la. The dashed line is for sample 
#1, where the 5 GOODS North epochs were used as the search data, and the combined 
2004 ACS sample, as the reference. The dotted line is for sample #2, where the 4 epochs 
of the 2004 ACS dataset were used as the search data, and the combined GOODS North 
data, as the reference. The dashed-dotted line is for sample #3, where all 5 epochs of the 
GOODS South sample were used as the search data; and the combined epochs 4+5 of the 
South GOODS dataset, as the reference. The solid line is for sample 7^4, where all 5 epochs 
of the GOODS South sample were used as the search data; and the combined epochs 1+2 
of the South GOODS dataset, as the reference. 



4.2. Calculating eja and ecc 

In order to determine the efficiency of the stage 3 selection, we generate four Monte 
Carlo datasets simulating the data from the four datasets listed in Table [2] (in other words, 
they have the same sampling, exposure times, etc., as the data). Each Monte Carlo dataset 
contains 500 candidates for each of the 5 supernova types considered (la, Ibc, IIL, IIP, and 
Iln). The redshifts of these candidates are drawn from a Gaussian distribution that uses the 
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redshifts and redshift errors of the real data events; the exposure times and samphng intervals 
also mimic those of the real data. The candidates' rest-frame i?-band magnitudes, stretch 
(for Type la's), and extinction parameters are drawn from the appropriate distributions used 
in Eqns. [10] and [TTJ The time period between the date of explosion and the first observation 
is randomly drawn from a flat distribution. In addition, because we are simulating a dataset 
as it would appear by the time it is ready for the stage 3 selection, we impose the same 
selection requirements from stages 1 and 2 on these Monte Carlo events as we do on the real 
data. 

After these Monte Carlo samples are generated, we calculate the number of candidates 
in each redshift bin. Dividing this number by the total number of the generated la's yields 
the efficiency ej'j^, for redshift bin j for Monte Carlo dataset m. Similarly, the efficiency for 
non-Type la candidates, e™cc' defined as the sum of the probabilities of the no n- Type 
la candidates divided by the total number of all generated non-Type la supernovae. The 
values of e^jjs range from ~10 to 90%; and the values of e^ccS, ~3 to 50%, depending on 
the redshift bin. 



4.3. Comparison of Expected and Observed Numbers of Supernovae 

We can now put everything together and compute the expected numbers of supernovae 
for a given model of the Type la supernova rates using Eqn. [23l We calculate the observed 
numbers of supernovae for redshift s z < 1.7, a s well as the expected numbers of supernovae 



for the two models considered in iPain et al.l (120021 ): a redshift-independent one and one 



evolving with redshift as a power law. We perform a least-squares fit of the observed numbers 
of supernovae to the predictions for both models. We also perform a maximum likelihood 
fit and compare the results. 

• Redshift-independent rate. Assuming the rate is fiat as a function of redshift, we 
obtain the best-fit value of rv,ia = (1-1 ^a2) x 10"'' Ni^/ (year Mpc^ h^^) , with a = 
11.5 for 16 degrees of freedom. Figure [5] (left) shows the resulting distribution of the 
predicted and observed numbers of supernovae. The errors on the predicted numbers 
of supernovae are a quadratic combination of the statistical and systematic errors (the 
statistical and systematic errors are comparable). The maximum likelihood method 
yields rv,ia = i^-^tol) x 10"^ A^ia/(year Mpc^ ^to ), consistent with the method. 

• Rate evolving as a power law with redshift. Assuming the rate is varying as a 
function of redshift as (1 + ^)", using the fitter we obtain the best-fitting value for a 



0.2lQ'y with a = 11-4 for 15 degrees of freedom. This is consistent with lPain et al. 
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(120021 ) . who found a = 0.8 ± 1.6. Note that the fit results are also consistent with the 
a = case that was considered above. The maximum likelihood method yields a = 

-0.41:?. 




Fig. 5. — Left: The total observed candidates for the 4 samples are plotted as a function 
of redshift (filled circles). The errors on the observed candidates are given in Table [HI 
The predicted number of candidates has been computed assuming a redshift-independent 
volumetric Type la rate of rv,ia(^) = 1-1 x 10""^ A^ia/(year Mpc^ ^70)])^ plotted as 

a dashed histogram. The shaded region around the predicted numbers indicates the range 
of combined statistical and systematic errors. The contributions from the statistical and 
systematic errors are comparable. Right: The calculated rates as a function of redshift (filled 
circles), with overplotted fit results to the fits described in the text: redshift-independent 
rate (solid line) and power-law redshift dependent rate (dashed line). Note that the plot 
does not show the rates in the first redshift bin; this is because in this bin the rates are 
effectively unconstrained on the scale shown. 



Both the redshift-independent model and the power-law model yield acceptable fit results, 
judging by the obtained x^'s (note, however, that the data points in neighboring bins are 
correlated, leading to lower per DOF). The probability p{Ax^\^DOF) = 0.1, where Ax^ 
and ADOF are the difference in the ^ind the numbers of degrees of freedom, respec- 
tively, for the redshift-independent model and the power-law model. In other words, the 
approximate probability that data would fluctuate from the redshift-independent model to 
the power-law model is 0.1. This fact indicates that our description of the experiment is 
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good at both low and high redshifts. One must note, however, that at redshifts > 1 the sam- 
ples start becoming sparser, and at redshifts > 1.4 the measurement becomes particularly 
difficult with this dataset. 



5. Comparison to Rates in the Literature 



To compare our results with those of iDahlen et al.l (120041 ). we now compute the Type 
la supernova rates in four large redshift bins, 0.2 < z < 0.6, 0.6 < z < 1.0, 1.0 < z < 1.4, 
and 1.4 < z < 1.7. Table fTTl enumerates the estimates for the number of candidates in these 
redshift bins for the four samples listed in Table [H 



Redshift bin 


d] 






4 


Total 


0.2 < z < 0.6 
0.6 < z < 1.0 
1.0 < z < 1.4 
1.4 < z < 1.7 


2A0tlil (0) 
ISAOtlil (6) 

3.23tlfr (3) 

omtl fo (0) 


1 O1+3.05 

o QK+3.47 
o.oO__j^ 25 


o.ool+i^ (0) 

(2) 

1-501^:1^ (1) 

o-oolJ:^3 (1) 


l.OOl^;! (2) 
1.0712-42 (2) 

i-7olg:i (1) 


18.33^11 

Q Q7+3.13 
o-o'-2.36 
^^++"^2 
•^•■^^-0.35 



Table 11: The best estimate {i.e., the most probable) number of la's, d^, in Az = 0.4 (0.3 
for the last bin) redshift bins (j = [1,..,4]), for the four samples listed in Table [2] (m = 
[1,.., 4]). The nunabers in parenthesis are the number of gold an d silver la's in the s ample 
from iRiess et al.l (l2004al ). that were used in the rates analysis of iDahlen et al.l (120041 ) . The 
total numbers are the results of applying the counting procedure described in the text to the 
combined candidates from all four samples (in other words, the total probability distribution 
is not a trivial sum of the probability distributions for the four samples). All the uncertainties 
reflect a 68% confidence region. 



Using all four samples, we can now compute the rates for each bin using Eqn. [231 The 
values for QT{z), dV/dz, rv,cc/rv,ia^ and z are taken in the middle of the bin. The errors on 
the rates are a quadratic combination of the errors on the number of observed la's listed in 
Table [TTl as well as statistical and systematic errors on the right- hand- side part of Eqn. [231 
The resulting rates are su mmarized in Table [T2l and plotted in Fig. [6l together with the rates 
from [Pahlen et al.[ (120041 ) and results from the literature at lower redshifts. It is apparent 
that our results are consistent with those fr om the literature: in particular, at higher redshifts 
our rates are not inconsistent with those of [Pahlen et al.[ (|2004| ). although obtaining a precise 
measure of the consistency would require a careful evaluation of the correlations between the 
samples used in both analyses. 
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Redshift bin 


rv,ia(z) 

([10-4 iVia/(year Mpc^ h:^,^)]) 


0.2 < z < 0.6 
0.6 < ^ < 1.0 
1.0 < ^ < 1.4 
lA<z< 1.7 




"J-^'-'-O.l? 
n qQ+0.25 
U.JO_Q 25 

"J- ' "^-0.30 

12+°-^'^ 



Table 12: The Type la supernova rates in the four redshift bins considered. The errors are 
a quadratic combination of the errors on (i™'s hsted in Table [TTl as well as statistical and 
systematic errors on the right-hand-side of Eqn. [231 



Note that Type la supernovae that we have considered encompass a wide range of 
magnitudes, stretch parameters, extinction possibilities, etc.. Therefore, the procedure de- 
scribed in se ction 13.31 accounts f or not only the more standard Type la's (such as those 
described in 



Branch et al 



( 1993h'). but also non-standard Type la's, such as type 1991bg 



and 1991T iFilippenko et al.l (119921 ). 1991bg-like supernovae have low values of the stretch 
parameter [s = 0.71 ± 0.05), and are typically ~1.7 magnitudes fainter in the y-band and 
~2.6 magnitudes fainter in the 5-band. Stretch values of 0.71 are certainly within the range 
of the stretch parameters we considered; as for the magnitudes, it is reassuring to note that 
the case of strong extinction {Ay = 1) did not significantly alter our results (see Table [H]). 
1991T-like supernovae are about 0.5-0.9 magnitudes brighter than normal la's, with stretch 
s = 1.07 ± 0.06. Both the stretch and the magnitude values are well within the considered 
ranges of these parameters. Note also that the fact that the Bayesian classification method 
was able to accurately type the vast majority of the 73 Type la candidates from the SNLS 
dataset, as was demonstrated in iKuznetsova and ConnoUyl (120071 ). shows that the method is 
capable of identifying Type la's in large populations that presumably include non-standard 
la's. 



It is particularly interesting to compare our rate results with that of lDahlen et al.l (120041 ). 
That study also analyzed the GOODS sample, but there are important differences in our 
methods, as pointed out above (Sec. 1). While our results are in statistical agreement, our 
measured rate in a given bin can differ from theirs through either the candidate counting or 
the calculation of the control time/efficiency. 



• Candidate counting: In some bins, the final count of the candidates ends up being 
about the same for both analyses, but the actual candidates are not the same. This 
is not unexpected because the techniques used for the supernova identification in the 
two analyses are quite different. Our method provides a probabilistic rather than an 
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Fig. 6. — The filled circles are the results of this work. The first three empty squares 



at low redshifts (t h e "lo w -^ results" ) are 



of ICappellaro et al.l (119991 ). iMadgwick et al.l (I2CI031). and iBlanc et al.l (120041 ) . respectively. 



rom lower to higher r e dshift s, the results 



The open u pturned triangle at z = 0.47 is from 



Neill et al.l (120061 ). The open c ircle a t z 



0.50 is from lTonry et al.l (l2003l ) . The open star at z = 0.55 is from lPain et al.l (120021). The 



open d iamonds are the results of lBarris and Tonryl (120061 ). The crosses are from lDahlen et al. 
(120041 ) (including systematic errors). The horizontal bars are estimated redshift bin sizes. 



absolute identification of each individual supernova based on its photometric measure- 
ments alone; the same probabilistic approach is used for calculating the efficiency and 
mis- ident ificat ion . 

For example, in the highest redshift bin, we have one can didate but this is from Sample 
2, which was taken after the work of iDahlen et al.l (120041 ) was published. However, the 
two high-red shift la candidates fr om Samples 3 and 4, SN-2002fx and SN-2003ak, which 
were used in lDahlen et al.l (120041 ). did not pass our stage 2 cuts. 



Control time/efRciency: A rigorous com parison of the control times is difficult due 
to the lack of tabulated control time data in lDahlen et al.l (120041 ). However, a rough es- 
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timation of the control time times efficiency factor from the data given in lDahlen et al. 



(120041 ) shows that this factor is approximately half our values for all but the highest 
redshift bin. 



6. The Star Formation History Connection 



A particularly interesting aspect of a Type la supernova rates analysis is the possibility 
of constraining the delay time between the formation of a progenitor star and a supernova 
explosion, which in turn helps constrain possible models for the Type la supernova formation. 
There are two leading models that have been considered in the recent literature: the so- 
called two-component model and a Gaussian delay model. We will now consider both of 
these models. Unlike sect ion 14.31 now that we are considering the rates we can add the low- 



redshi ft measurements of ICappellaro et al.l (Il999l ). iMadgwick et al.l (120031 1 . and iBlanc et al. 
( I2OO4J ) to our results and use the combined data in the fits. 



The two-component model (IScannapieco and Bildstrerul2005l : iMannucci et al.ll2006l ) sug- 
gests that that the delay function may be bimodal, with one component responsible for the 
"prompt" Type la supernovae that explode soon after the formation of their progenitors; 
and the other, for the "tardy" supernovae that have a much longer delay time. Following 
this model, the Type la supernova rate can be represented as: 



rv,ia(z) = Ap^{z) + Bp^{z), 



(26) 



where p*(z) is the integrated SFH and p*(^) is the instantaneous SFH. The ffist term of the 
equation accounts for the "tardy" population, while the second, for the "pr ompt" one. We 
use the parametric form of the SFH as given in lHopkins and Beaconj (120061 ): 



(a + bz) h 



70 



1 + {z/cY 



(27) 



where h^o = 0.7, a = 0.017, b = 0.13, c = 3.3, d = 5.3. 



The Levenberg-Marquardt least-squares fit of the combined data to the two-component 
model is shown in Fig. [7] (left). We obtain A = (1.5 ± 0.7) xlO^^^ yr'^Mg^ and B = 
(5. 4 ± 2.0) xlO~^ y T~^/(Mp^ yr~^)- These results are entirely consistent with those obtained 
bv lNeill et all J2006h : A = (1.4 ± 1.0) x 10"^^ yr'^M^^ and B = (8.0 ± 2.6) x 10"^ yr-V(M0 



yr ^). The of the fit is 5.4 for 5 degrees of freedom. 



Note that the results of iBarris and Tonryl (120061 ) at ^ = 0.55, 0.65, and 0.75 are some- 
what inconsistent with our best-fitting two-component model, with the discrepancy at the 
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le vel of 4.1, 3 . 2, an d 5.2 o", respectively. This can be s een from Fig. El It has been argued 



m 



Neill et al.l (120061 ) (who also noted that the results of iBarris and Tonryl (120061 ) beyond the 
redshift of 0.5 appear to be rather high) that contamination by non-Type la's is the most 
likely source of the problem. 



It was suggested in iDahlen et al.l (12004 ) and IStrolger et al.l (120041 ) that the la rate is a 
convolution of the SFH and a Gaussian time delay distribution function with a characteristic 
time delay r ~3 Gyr and a a = 0.2 r. Using the Hopkins-Beacom SFH, we find that the 
best-fitting parameters for such a model are r = (3.2 ± 0.6) Gyr and a = (0.12 ± 0.54) 
r, with a fit of 2.1 for 4 degrees of freedom. The fit is shown in Fig. 171 (right) . For 
compa rison, we also show the rate model obtained using the parameters from 
kooi l (r = 3 Gyr and a = 0.2 r). 



Strolger et al. 



-3.0 



-3.5 



-4.0 



2 

^ -4.5 



-5.0 



-5.5 



-6.0 





1 1 1 1 1 1 1 


1 1 1 1 1 1 1 




— 






. . , 1 . , . . 1 . , . . 





0.0 



0.5 



1.0 
Redshift 



2.0 




Fig. 7. — Left: The least-square fit of the two-component model to the data. The dashed 
line represents the prompt component that is proportional to the instantaneous SFH. The 
dotted line represents the tardy component that is proportional to the integrated SFH. The 
thick solid line is the sum of the two. Right: The Gaussian tii ne delay model with our best- 
fitted parameters (solid line), as well as with the parameters of IStrolger et al.l (120041 ) (dashed 
line). In both plots, the first three empty square s at low redshifts a re, fro m lo wer to higher 



redshifts, the results of ICappellaro et al.l (ll999l ). iMadgwick et al.l (120031 ) . and iBlanc et al. 



(120041 ). respectively. The filled circles are the results of this work. The horizontal bars are 
estimated redshift bin sizes. 



One of the main differences between the two-component model and the time delay model 
is the predicted behavior at high redshifts: the former predicts an increase in the rates, while 
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the latter, a decrease. From Fig. [7] and the results of the fits of our data to both models, we 
find that neither scenario can be ruled out. 



7. Summary 

We have analyzed the rates of Type la supernovae up to a redshift of 1.7 using two 
samples collected with the HST: the GOODS data, and the 2004 ACS sample collected in 
the Spring-Summer 2004 covering the GOODS North field. Using only the data from two 
broadband filters, F775W and F850LP, we applied a novel technique for identifying Type la 
supernovae based on a Bayesian probability approach. This method allows us to automat- 
ically type supernova candidates in large samples, properly taking into account all known 
sources of systematic error. We also make use of the best currently available full spectral 
templates for five different supernova types for the candidate typing, as well as for calcu- 
lating the efficiency of our supernova search, and the control time. These templates will 
undoubtedly be improved over the next several years as more supernova data becomes avail- 
able. Current and upcoming supernova surveys will not only provide a better understanding 
of individual supernova types, but may also uncover new types of supernovae, which can 
then be added to the Bayesian classification framework. Likewise, a better understanding 
of the many parameters that affect supernova observations will improve the classification 
scheme, which will result in better constraints on the measured rates. The calculations of 
the supernova finding efficiency, the control time, and the survey area are all done taking 
into account the specific observing configurations pertinent for the surveys, such as exposure 
times, cadences, and the orientations of the GOODS tiles. 

We carried out a comparison of the predicted and observed numbers of supernovae in 
redshift bins of Az = 0.1, for two different models of the Type la supernova rates: a redshift- 
independent rate and a power-law redshift-dependent rate. We find that the available data 
fit both models equally well. 



For comparison with previous work, particularly that of iDahlen et al.l (120041 ). who also 
analyzed a large subset of the data used here, we calculated the volumetric Type la supernova 
rates in four redshift bins, 0.2 < z < 0.6, 0.6 < z < 1.0, 1.0 < z < 1.4, and 1 . 4 < z < 1.7. 



We find that our results are generally consistent with those of iDahlen et al.l (l2004l ). Due 
to the larger of number supernova candidates which this Bayesian classification technique 
makes available, we obtain smaller or equal uncertainties in all the bins up to z = 1.7. In 
the highest redshift bin we obtain a larger uncertainty because the signal to noise ratio is 
generally too low to apply this technique. 
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We fitted the resulting rates to two leading models used in recent literature: the two- 
component model and a Gaussian time delay model. The former model implies an increase 
in the Type la supernova rates at highest redshifts; while the latter, a decrease. We find 
that the statistics of the present sample does not definitively discrimi nate between the two 



scenarios - only one supernova in this work and two supernovae in iDahlen et al.l (120041 ) 
contribute to the important highest-redshift bin. Significantly larger surveillance time would 
be required to arrive at a conclusive statement on the trends for the Type la rates at high 
redshifts. 

In the future, several ambitious new surveys are planned that will collect photometric 
data for thousands of supernovae in order to improve the constraints on dark energy. Indi- 
vidual spectroscopic follow up for every supernova candidate is hkely to be impractical in 
these surveys. The Bayesian classification method described here has the ability to classify 
supernovae using photometric measurements alone, and is a promising technique for these 
future surveys. 
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